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MATRIX POSITIVITY PRESERVERS IN FIXED DIMENSION. I 


ALEXANDER BELTON*, DOMINIQUE GUILLOTt, APOORVA KHARE*, 
AND MIHAI PUTINARS 


Abstract. A classical theorem proved in 1942 by I.J. Schoenberg describes all real¬ 
valued functions that preserve positivity when applied entrywise to positive semi- 
definite matrices of arbitrary size; such functions are necessarily analytic with non¬ 
negative Taylor coefficients. Despite the great deal of interest generated by this 
theorem, a characterization of functions preserving positivity for matrices of fixed 
dimension is not known. 

In this paper, we provide a complete description of polynomials of degree N that 
preserve positivity when applied entrywise to matrices of dimension N. This is the 
key step for us then to obtain negative lower bounds on the coefficients of analytic 
functions so that these functions preserve positivity in a prescribed dimension. The 
proof of the main technical inequality is representation theoretic, and employs the 
theory of Schur polynomials. Interpreted in the context of linear pencils of matri¬ 
ces, our main results provide a closed-form expression for the lowest critical value, 
revealing at the same time an unexpected spectral discontinuity phenomenon. 

Tight linear matrix inequalities for Hadamard powers of matrices and a sharp 
asymptotic bound for the matrix-cube problem involving Hadamard powers are ob¬ 
tained as applications. Positivity preservers are also naturally interpreted as solutions 
of a variational inequality involving generalized Rayleigh quotients. This optimiza¬ 
tion approach leads to a novel description of the simultaneous kernels of Hadamard 
powers, and a family of stratifications of the cone of positive semidefinite matrices. 
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1. Introduction and main results 

Transformations, linear or not, which preserve matrix structures with positivity con¬ 
straints have been recently studied in at least three distinct frameworks: statistical 
mechanics and the geometry of polynomials PEI [TO]; global optimization algorithms 
based on the cone of hyperbolic or positive definite polynomials pismsHi; the statis¬ 
tics of big data, having the correlation matrix of a large number of random variables 
as the central object P ESllsaiiollle]. The present article belongs in the latter two 
categories, although the main result may be of independent algebraic interest. 

To describe the contents of this paper, we adopt some terminology. For a set 7L C C 
and an integer > 1, denote by Vn{K) the cone of positive semidefinite N x N 
matrices with entries in K. A function / : AT —)• C naturally acts entrywise on Vn{K), 
so that f[A] := {f{aij)) for any A = {aij) G Vn{K). Akin to the theory of positive 
definite functions, it is natural to seek characterizations of those functions / such that 
/[A] is positive semidefinite for all A G Vn{K). A well-known theorem of Schoenberg 
[42] states that f[A] is positive semidehnite for all A G PAr([— 1,1]) of all dimensions 

> 1 if and only if / is absolutely monotonic on [0,1] (i.e., analytic with non-negative 
Taylor coefficients). To put Schoenberg’s 1942 article in historical perspective, we have 
to recall that the theory of absolute monotone functions was already established by 
S. Bernstein |3]. Also, it is worth mentioning that Schoenberg was working around 
that time on the related and more general question of isometrically embedding positive 
definite metrics into Hilbert space; see, for instance, |45j . The parallel theory of matrix 
monotone functions, with f{A) dehned by standard functional calculus, owes its main 
result to Loewner [33] (see also [T2]i. 

Since its publication, Schoenberg’s theorem has attracted a great deal of attention. 
The result has been considered in several different contexts in PElIZlIIIllSgill]. See 
also P da ESI I291136] for recent work. 

Schoenberg’s observation has a natural application to high-dimensional probability 
and statistics. Recall that a correlation matrix is the Gram matrix of vectors on a 
sphere In concrete situations, functions are often applied entrywise to corre¬ 

lation matrices, in order to improve their properties, such as better conditioning, or 
to induce a Markov random field structure; among the recent investigations centered 
on this technique we note [iEniEiiissiEsiEaiiiis]. Whether or not the result¬ 
ing matrices are positive semidehnite is critical for the validity of these procedures. 
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According to Schoenberg’s theorem, functions preserving positivity when applied en- 
trywise to correlation matrices of all dimensions and bounded rank are non-negative 
combinations of Gegenbauer polynomials. However, allowing for arbitrary dimension 
is unnecessarily restrictive, as the state space of the problem is usually known; at the 
least, its dimension often has an apparent upper bound. Motivated by such practical 
demands, characterizations of positivity preserving functions have recently been ob¬ 
tained in fixed dimensions, under further constraints that arise in practice; see, for 
example. 

In the case of a fixed dimension N, obtaining characterizations of functions which 
preserve matrix positivity when applied entrywise remains a difficult open problem, 
even discouraging in view of the scarcity of known results for low rank and low degree. 
Using an idea of Loewner, Horn showed in |3n) that if a continuous function / : (0, oo) —)• 
M satisfies /[—] : Vn{{0,oo)) —)■ VnO^), then / G C^~^{{0,oo)) and f^^\x) > 0 for all 
X > 0 and all 0 < /c < A" — 3. Moreover, if it is known that / G (^^“^((O, oo)), then 
f^^\x) > 0 for all X > 0 and all 0 < A; < A — 1. 

As of today, obtaining an effective characterization of arbitrary functions which 
preserve matrix positivity in a hxed dimension looks rather inaccessible. The main 
result of the present article provides the hrst such characterization for polynomial 
functions, and illustrates the complexity of the coefficient bounds, even in the simplest 
of situations. 

Theorem 1.1. Fix p > 0 and integers A > 1, M > 0 and let f{z) = Ylf=o CjZ^ + c'z^ 
be a polynomial with real coefficients. Also denote by D{0,p) the closed disc in C with 
radius p > 0 and center the origin. For any vector d := {do,... ,dN-i) with non-zero 
entries, let 


N-l 

C{d)=C{d;z^-,N,p) := ^ 

j=0 


M' 

j 


M-j-1 
N-j-I 


^pM-j 


( 1 . 1 ) 


and let c := (cq, ..., ctv-i)- The following are equivalent. 

(1) /[—] preserves positivity on T’n{D{ 0, p)). 

(2) The coeffieients Cj satisfy either cq, ..., cn-i, c' >0, or cq, ..., cn-i > 0 and 
c' > —C(c)“^. 

(3) /[—] preserves positivity on P^((0, p)), the set of matrices in Vj\f{{0,p)) having 
rank at most 1. 


Note that the necessity of having cq, ..., cw-i > 0 in part (2) of the theorem follows 
from Horn’s theorem as stated above. The constant C(c; z^ ] A, p) provides a threshold 
for polynomials that preserve positivity on Vn but not on Vn+i- Our theorem thus 
provides a quantitative version in fixed dimension of Schoenberg’s result, as well as of 
Horn’s result. As should be expected, our bound goes asymptotically to the Schoenberg 
degree-free statement, since C{c; z^■, N, p) —>■ oo as A —>■ oo. It is remarkable that the 
proof of Theorem 11.11 is obtained by using Schur polynomials. 
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Remark 1.2. Notice that if M < then C{c] ] N, p) makes sense and equals CjJ, 
if we use for any complex number z E C the formulas 


\n) n\ 



:= 1 . 


In effect, Theorem 11.11 savs in this case that when / is a polynomial of degree at most 
N — 1, the map /[—] preserves positivity on p)) if and only if all coefficients 

of / are non-negative. 


Theorem 11.11 provides a decisive first step towards isolating classes of functions that 
preserve positivity on Vn when applied entrywise. Additionally, the result yields a 
wealth of interesting consequences that initiate the development of an entrywise matrix 
calculus that leaves invariant the cone Vn, in parallel and in contrast to the much better 
understood standard functional calculus. The next theorem provides a constructive 
criterion for preserving positivity, applicable to all analytic functions. In the theorem 
and thereafter, the N x N matrix with all entries equal to 1 is denoted by IatxaT) and 
A°^ := (cL^j) denotes the feth Hadamard (or entrywise, or Schur) power of A. 

Theorem 1.3. Fix p > 0 and an integer N > 1. Let c := (cq, ... ,CAr_i) E (0, oo)'^, 
and suppose g{z) := Y1m=n is analytic on D{0,p) and continuous on D{t),p), 

with real coefficients. Then 

t{colNxN + ciA + • • • -|- cn-iA°^^ ~ ^ Vn{V.) ( 1 - 2 ) 

for all A E Vn{D{ 0, p)) and all 

t> ^ cmC{c-,z^-,N,p). 

Moreover, this series is convergent, being bounded above by 

2^-\N-l)\^ j ) c, ’ 

where g 2 {z) := 9+{z^) and 

9+{z) ■■= ^ CMZ^. 

M>N:cm>0 

Note that Theorem o concerns the special case of Theorem 11.31 with g{z) = cmz^. 

Theorem 11.31 provides a sufficient condition for a large class of functions to preserve 
positive semidefiniteness in fixed dimension. The loosening of the tight thresholds for 
the individual coefficients is compensated in this case by the closed form of the bound 
for the lowest eigenvalue of the respective matrix pencil. 

Next we describe some consequences of our main results. For A E Vn{K) and / as 
in Theorem ll.il with M > N, note that 

f[A] = cqInxN + ciA + • • • -|- CAr_iA°^'^ + cmA°^, 

where cm = c' . Understanding when f[A\ is positive semidefinite is thus equivalent to 
controlling the spectrum of linear combinations of Hadamard powers of A, by obtaining 
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linear inequalities of the form 

CO^NxN + CiA + • • • + + CmA°^ > 0 , 

where the order is the Loewner ordering, given by the cone Vn{C). A direct application 
of our main theorem provides a sharp bound for controlling the Hadamard powers of 
positive semidefinite matrices. 

Corollary 1.4. Fix p > 0, integers M > N > 1, and scalars cq, ..., cn-i > 0. Then 

A°^ < C(c; z^; N, p) ■ [cqInxN + ciA + • • • + C 7 v-iA°^'^ (1-4) 

for all A E Vn{D{Q, p)). Moreover, the constant C{c; ; N, p) is sharp. 

As an immediate consequence of Lemma 12.41 below, notice that the right-hand side 
of Equation (ll.ip cannot be replaced by a sum of fewer than N Hadamard powers of A. 
Corollary II.41 thus yields a sharp bound for controlling the Hadamard power A°^ with 
the smallest number of powers of lower order. 

In a different direction, our main result naturally fits into the fast developing area 
of spectrahedra [SI HU and the matrix cube problem [37|. The latter, a key technical 
ingredient in modern optimization theory, continues to attract the attention of various 
groups of researchers, mostly in applied mathematics. Recall that given real symmetric 
N X N matrices Aq, ..., Am+i, where M > 0, the corresponding matrix cubes are 

M+l 

l{[ri] := |Ao ^ UmAm : Um G [-i?,??]} (?? > 0). (1.5) 

m=l 

The matrix cube problem consists of determining whether Idlp] C Vn, and finding the 
largest rj for which this is the case. As another consequence of our main result, we 
obtain an asymptotically sharp bound for the matrix cube problem when the matrices 
Am are Hadamard powers. 

Corollary 1.5. Fix p > 0 and integers M > 0, N > 1. Given a matrix A E 
Vn{D{Q,p)), let 

Aq := cqInxN + ciA H--|- 

and 

Am ■■= forl<m<M + l, 

where the coefficients cq, ..., cn-i > 0. Then 

P< ^ U[p]cVn{C) (1.6) 

Vm=0 / 

^ ri <C{c-,z^+^-,N,p)-^. (1.7) 

The upper and lower bounds for p are asymptotically equal as N ^ oo, i.e., 

M 

lim C{c-z^^^-N,p)-^^C{c-,z^^^-,N,p) = 1. (1.8) 

N^oo r. —/ 

m=0 
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See the end of Section [331 for the proof of this result. 

Finally, understanding which polynomials preserve positivity on Vn{K) can be refor¬ 
mulated as an extremal problem involving generalized Rayleigh quotients of Hadamard 
powers. 


Theorem 1.6. Fix p > 0, integers M > N > 1, and scalars cq, ..., cat-i > 0. Then 


: (^N-\ 


inf 

ve/C(A)-L\{o} 






>C{c-z^;N,p) 




\-l 


for all non-zero A G 'Pn{D{D, p)), where 

IC{A) := ker(colAfxA + ciA C]\f—iA°^^ 

The lower bound C{c] z^\ N, p)~^ is sharp, and may be obtained by considering only 
the set of rank-one matrices R^((0,p)). 

We note the surprising fact that the left-hand side of the inequality above is not 
continuous in the variable A] see Remark liTH Motivated by this approach, we obtain, 
in Theorem 15.71 below, a description of the kernel IC{A) for a given matrix A € Vn{^)- 
As we show, this kernel coincides with the simultaneous kernels n„>oker A””’ of the 
Hadamard powers of A, and leads to a hitherto unexplored stratification of the space 
Vn{C). 

The rest of this paper is organized as follows. We review background material in 
Section |2l The main result of the paper is proved in Section |3l along with many in¬ 
termediate results on Schur polynomials that may be interesting in their own right. 
We also show in Section [3| how our main result can naturally be extended to gen¬ 
eral polynomials, and to analytic functions. Section |4| deals with the reformulation of 
our main theorem as a variational problem, and provides a closed-form expression for 
the extreme critical value of a single positive semidefinite matrix. Beginning with a 
novel block-matrix decomposition of such matrices into rank-one components, Section[5| 
provides a description of the simultaneous kernels of Hadamard powers of a positive 
semidefinite matrix. The last section contains a unified presentation of a dozen known 
computations in closed form, of extreme critical values for matrix pencils, all relevant 
to our present work. 


Acknowledgments. The authors are grateful to Bala Rajaratnam for sharing his 
enthusiasm and ideas that attracted the four of us to these topics. We would like to 
thank the American Institute of Mathematics (AIM) for hosting the workshop “Positiv¬ 
ity, graphical models, and modeling of complex multivariate dependencies” in October 
2014, where this project was initiated. We thank Yuan Xu for very stimulating con¬ 
versations at the AIM workshop, which were helpful in embarking upon this project, 
and Shmuel Friedland, for valuable insights into generalized Rayleigh quotients. We 
are indebted to the referee for a careful examination of the contents of the manuscript 
and constructive criticism on its presentation. 


2. Background and notation 

Given a subset K C C and integers 1 < A: < A, let V^{K) denote the set of 
positive semidefinite N x N matrices with entries in K and with rank at most k, and 
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let Vn{K) := V^{K). Given a matrix A, let A°^ denote the matrix obtained from A 
by taking the /cth power of each entry; in particular, if A is an N x N matrix then 
= ItvxV) the N X N matrix with each entry equal to 1. 

Recall that the Gegenbauer or ultraspherical polynomials Cn^\x) satisfy 

OO 

{l-2xt + t‘^)-^ = (A>0), 

n=0 

while for the Chebyshev polynomials of the first kind Cn^ (x) we have 

OO 

(1 — xt)(l — 2xt + ^ Cj{*\x)t"'. 

n=0 

We begin by recalling Schoenberg’s original statement, which classifies positive def¬ 
inite functions on a sphere of fixed dimension. 

Theorem 2.1 (Schoenberg, [1^ Theorems 1 and 2]). Fix an integer d > 2 and a 
continuous function / : [—1,1] —>■ M. 

(1) The function /(cos-) is positive definite on the unit sphere S'^~^ if and only 

if f can be written as a non-negative linear combination of the Gegenbauer or 
Ghebyshev polynomials where X = {d — 2)/2; 

fix) = anC^^\x) {an > 0 ). 

n>0 

(2) The entrywise function /[—] : P 7 v([—Ij 1]) —>• ’Pa(I^) for all N > 1 if and only if 
f is analytic on [—1,1] and absolutely monotonic on [0,1], i.e., f has a Taylor 
series with non-negative coefficients convergent on the closed unit disc D{0, 1). 

For more on absolutely monotonic functions, see the work [3] of Bernstein. Rudin 
[41j proved part (2) of the above result without the continuity assumption, in addition 
to several other characterizations of such a function /. His work was a part of the 
broader context of studying functions acting on Fourier-Stieltjes transforms in locally 
compact groups, as explored in joint works with Kahane, Helson, and Katznelson in 

[SIEI]. 

The work of Schoenberg has subsequently been extended along several directions; 
see, for example, [DElEKnilMlElllS]. However, the solution to the original problem 
in fixed dimension remains elusive when N > 2. 

An interesting necessary condition for a continuous function / : (0, oo) ^ M to 
preserve positivity in fixed dimension has been provided by Horn m- This result was 
recently extended in m to apply in the case of low-rank matrices with entries in (0, p) 
for some p > 0, and without the continuity assumption; on the last point, see also 
Hiai’s work |29j . 

Theorem 2.2 (Horn [SOj, Guillot-Khare-Rajaratnam [H]). Suppose / : / ^ M, where 
I := {0,p) and 0 < p < oo. Fix an integer N >2 and suppose that f[A] G for 

any A G Vff{I) of the form A = oInxN + uu^, where a G (0, p) and u G [0, ^/p — a)^ . 
Then f G C^-^{I), with 

f^^\x) >0 VxGl, 0<A:<iV-3, 
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and is a convex non-decreasing function on I. If, further, f G ^(-^); then 

f(k)^x'^ > 0 for all X & I and 0 < k < N — 1. 

Note that all real power functions preserve positivity on p)), yet such func¬ 

tions need not have even a single positive derivative on (0, p). However, as shown in 
Theorem 12.21 working with a small one-parameter extension of ^*^((0, p)) guarantees 
that is non-negative on (0, p) for 0 < k < N — 3. 

Remark 2.3. Theorem l2.2l is sharp in the sense that there exist functions / : (0, p) —>■ M 
which preserve positivity on Pjv((0,p)), but not on Pjv+i((0,p))- For example, f{x) = 
x", where a G (N — 2, N — 1) is an example of such a function; see [T^ fTTl [29] for more 
details. Thus the bound on the number of non-negative derivatives in Theorem 12.21 is 
sharp. 

In light of Remark 12.31 we focus henceforth on analytic functions which preserve 
Vn{K) for fixed N when applied entrywise. Note that any analytic function on D(0, p) 
that maps (0, p) to M necessarily has real Taylor coefficients. 

Recall by Theorem 12.21 that if /[—] : P^((0, oo)) —)■ RAr(M) and / G oo)) 

then is non-negative on (0, oo) for < k < N — 1. The next lemma shows that if 
/ is assumed to be analytic, then it suffices to work with instead of in order to 
arrive at the same conclusion. 

Lemma 2.4. Let f : D{0,p) M. be an analytic function, where 0 < p < oo, so that 
= 'En>o^rix"' on D{0,p). ///[-] : 7^Ar((0,p)) for some integer N >1, 

then the first N non-zero Taylor coefficients Cj are strictly positive. 

In particular, if / is a sum of at most N monomials, then under the hypotheses of 
Lemma 1231 all coefficients of / are non-negative. 

Proof. Suppose that the first m non-zero coefficients are , ■ • •, Cn „,, where m < N. 
Fix u := (ui,..., Um)'^ £ (0, with distinct entries ui, ..., Um, and note the matrix 

non-singular [m, Chapter XIII, §8, Example 1]. Hence ..., is 

linearly independent and there exist vectors vi, ..., G M™' snch that = 

6j^k for all 1 < j, k < m. Since / preserves positivity on R^((0, p)), it follows that 

0 < e"''''v^/[euu'^]vfc = ^ Cn^ 

j>nm 

as e ^ O"*", for 1 < A: < m. The result follows. □ 

The above discussion naturally raises various questions. 

(1) Can one find necessary and sufficient conditions on a restricted class of func¬ 
tions, such as polynomials, to ensure that positivity is preserved in fixed di¬ 
mension? 

(2) Note that the power functions in Remark 12.31 f{x) = for a G {N — 2, iV — 1), 
are not analytic. Does there exist a function / analytic on an open subset 
U C C which preserves positivity on Vn{U), but not on RAr+i(I7)? 

Our main result provides positive answers to both of these questions. 
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3. SCHUR POLYNOMIALS AND HADAMARD POWERS 


The goal of this section is to prove Theorem ll.il The proof relies on a careful analysis 
of the polynomial 

p{t) := det{t{colNxN + ciA H-h - A°^), 

where A G P^(L)(0,p)). More specifically, we study algebraic properties of the poly¬ 
nomial p{t), and show how an explicit factorization can be obtained by exploiting the 
theory of symmetric polynomials. 


3.1. Determinantal identities for Hadamard powers. We begin with some tech¬ 
nical preliminaries involving Schur polynomials. 

As the results in this subsection may be of independent interest to specialists in 
symmetric functions and algebraic combinatorics, we state them over an arbitrary field 

F. 

Given a partition, i.e., a non-increasing Wtuple of non-negative integers n = (un > 
■ ■ ■ > ni), the corresponding Schur polynomial Sn(a;i, ■ ■ ■ ,xn) over a held F with at 
least N elements is dehned to be the unique polynomial extension to F^ of 

sn(xi,...,x^) ^ (3.1) 

det(Xj j 


for pairwise distinct Xi G F. Note that the denominator is precisely the Vandermonde 
determinant A 7 v(a:i,... ,XAr) := det{x^~^) = ni<i<j<Ar(^i ~ Xj); it follows from this 
that 


Sn(l,... ,2:^ ^) = 

1<2<J<W 


— z* 


Sn(l,...,l)= Yl 


Uj - Ui+j-i 


J - * 


(3.2) 

The last equation can also be deduced from Weyl Character Formula in type A; see, for 
example, [Ml Chapter 1.3, Example 1]. For more details about Schur polynomials and 
the theory of symmetric functions, see [M]. In particular, note that Schur polynomials 
have non-negative integer coefficients, by [Ml Chapter I, Equation (5.12)]. 


Proposition 3.1. Let A := uv'^, where u = {ui,... ,un)'^ and v := (ui,..., vn)'^ £ 
F-^ for N > 1. Given m-tuples of non-negative integers n = [um > n^-i > • • • > ni) 
and sealars (cm, ■ ■ ■ ,Cn^) G F™', the following determinantal identity holds: 

m N 

det^Cn.A°^^' = AAr(u)AAr(v) SA(n')(u)sA{n')(v) (^•3) 

j=l n'Cn, |n'|=V k=l 

Here, A(n') := {n'^ — A -|- 1 > n'^_i — N -\-2 > ■ ■ ■ > n'l) is obtained by subtracting the 
staircase partition {N — 1,..., 0) from n' := (n)y > • • • > n[), and the sum is over all 
subsets n' of cardinality N. In particular, if m < N then the determinant is zero. 


Proof. If there are m < N summands then the matrix in question has rank at most 
m < N, so it is singular; henceforth we suppose m > N. Note hrst that if c := 
and 

^(u, n,c) . (y ^Uj )l<j<N,l<k<7n 
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where we work over an algebraic closure of F, then 

m 

^ Cn. A°"-i = X (u, n, c)X (v, n, c)^. (3.4) 

i=i 

Next, let c|n' := (c„/,... ,c„/ ) and note that, by the Cauchy-Binet formula applied 
to (13.41) . 


det^CnjA°^i = ^ det(X(u,n',c|n/)X(v,n',c|n')'^) 

i=l n'Cn, |n'|=Af 

= ^ detX(u, n',c|n/)detX(v, n',c|n/) 

n'Cn, |n'|=Af 


= det ) det (n”'') . 

n'Cn, |n'|=Af k=l 

Each of the last two determinants is precisely the product of the appropriate Van¬ 
dermonde determinant times the Schur polynomial corresponding to A(n'); see Equa¬ 
tion (13.111 . This observation completes the proof. □ 


We now use Proposition 13.II to obtain an explicit factorization of the determinant of 
p[uv^] for a large class of polynomials p. 


Theorem 3.2. Let i? > 0 and M > N > 1 he integers, let cq, ..cjy-i G be 
non-zero sealars, and let the polynomial 

pt{x) := t{cox^ + • • • + CN-ix^^^-^) - x^+^, 

where t is a variable. Let the hook partition N,j) := {M—N+1, 1,... , 1,0,..., 0), 
with N — j — 1 entries after the first equal to 1 and the remaining j entries equal to 0. 
The following identity holds for all u = (tti,... ,un) and v := (ui,..., vn) G F^.- 


N 


detpi[uv^] = t^ ^AAr(u)AAr(v) 

i=i 


Cj-iufvf 


^ ^u{M,N,j) {'^)s^{M,N,j) (v) 

3=0 ^ 


(3.5) 


Proof. Let A = uv^ and note first that det(^°'^ o B) = ' detB for any 

N X N matrix B, so it suffices to prove the result when R = 0, which we assume from 
now on. 

Recall the Laplace formula: if B and C are N x N matrices, then 

det(R + C)= detMn{B;C), (3.6) 

nC{l,...,iV} 

where M,^{B; C) is the matrix formed by replacing the rows of B labelled by elements 
of n with the corresponding rows of C. In particular, if R = then 

N 

detpt[T] = det{tB — A°^) = t^ det B — t^~^ det M^jy{B; A°^), (3.7) 
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since the determinants in the remaining terms contain two rows of the rank-one matrix 
A°^. By Proposition 13.11 applied with rij = j — 1, we obtain 

det B = A7v(u)Ajv(v)co • • • cw-i- 

To compute the coefficient of note that taking t = 1 in Equation (13.71) gives that 

N 

det A°^) = det B — det [^]. 

i=i 

Moreover, detpi[74] can be computed using Proposition 13.11 with m = N + 1 and 

Cnjv+i = — 1: 

detp,[A] = detB - A„{u)A„{v)c„ ■ ■ ■ ^ 

1=0 

since = \{{M,N - 1,N + l,j,j - 1,... ,0)) for 0 < j < A - 1. 

The identity (13.5|) now follows. □ 

Remark 3.3. The connection between Theorem 13.21 and the constant C{c; ] N, p) 
in Theorem 11.11 stems from the fact that 

, , fM\ , 

P.8) 

for 0 < j < A —1. This is a straightforward application of Equation (|3.2p : alternatively, 
it follows by applying Stanley’s hook-content formula [l3l Theorem 15.3] to the hook 
Schur function p{M, N,j). We also mention a third proof using the dual JacobETrudi 
(Von Nagelsbach-Kostka) identity [Ml Chapter I, Equation (3.5)], which rewrites Schur 
polynomials in terms of elementary symmetric polynomials. The proof goes as follows: 
note that the dual partition of p,{M,N,j) is, up to attaching zeros at the end, the 
(M -|- l)-tuple p'{M, A, j) := (A — j,l,..., 1). Therefore > 1) equals the 

determinant of a 2 x 2 block triangular matrix, one of whose block diagonal submatrices 
is unipotent, and the determinant of the other is computed inductively to yield (|3.8p . 

Corollary 3.4. In the setting of Theorem \3.SX if M = N, then 

N N—1 / \ / \ 

detpi[^] =t^"^AAr(u)AAr(v) ew-j_u_ejv-A^^ ^ 

j=l j=o 

where ej{u) := Yli<ni<---<nj<N'^ni ■ ■ - Unj is the elementary symmetric polynomial 
in A variables of degree j. 

Proof. This follows immediately from the observation that is equal to CN-j] 

see [Ml Chapter I, Equation (3.9)]. □ 

We conclude this part with an identity, which shows how the Schur polynomials 
Si_i{M,N,j) ill Theorem 13.21 can be used to express the Hadamard power A°^^ as a com¬ 
bination of lower Hadamard powers. 
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Lemma 3.5. Fix integers M > N > 1, and let the N x N matrix A have entries in F. 
Denote the rows of A by ai, ..., a^r. Then 

N-l 

=Y^DmM)A°^, ( 3 . 10 ) 

1=0 

where DM,j{A) is the diagonal matrix 

( 1) ^ TV (a^),..., (aTv)^, 

and s^(tu,Aj) ®s in Theorem VJ.A 

Proof. Let v := (ui,..., vn), where vi, ..., vn are pairwise distinct and transcendental 
over F; we now work in the field F(i;i,..., Vn). If V is the Vandermonde matrix {vi~^) 
then, by Cramer’s rule, the solution to the equation 

Af-l 

Vu = (v°^)^ ^ v°^ = ^ iii+iv°^' 

1=0 

is given by setting Ui to equal the Schur polynomial i)(v) times (—1)^ *, to 

account for transposing the ith column to the final place. The result now follows from 
this identity, applied by specializing v to each row of □ 


3.2. Proof of the main theorem. Using the technical results on Schur polynomials 
established above, we can now prove Theorem ll.il 


Proof of Theorem \l.l[ We begin by proving the result when 0<M<iV—1. In this 
case, it follows immediately from Lemma r2.4l that the theorem holds, since C(c; z^;N, p) 
= by Remark 11.21 

Now assume M > N. To be consistent with the statement of the theorem, where 
the floating coefficient is denoted by c', for M > N we adopt the unifying notation 
Cm = c'. Clearly (1) implies (3). We will now show that (3) implies (2), and (2) implies 
( 1 ). 

(3) (2). Suppose that /[—] preserves positivity on P^((0,p)). By Lemma [2.41 

the first N non-zero coefficients of / are positive, so we suppose cm < 0 and prove 
that Cm > —C{c]z^]N,p)~^. Define pt{x) as in Theorem 13.21 with R = 0, and set 
t ■= |ctw|~^) so that |cTVf|~^/[—] = Pt[—] preserves positivity on rank-one matrices 
A = uu^ with u G (0, ^/p)^. Then, by Equation (I3.5p . 


N —1 / \2 

0 < detpiPuu"^] = t'^“^ATv(u)^co • • • ctv-i (^t — ^ ^ (3.11) 

j=o 

Now set Uk ■= v/p(I “ with G (0,1) pairwise distinct and t' G (0,1), so that 
Atv(u) 7 ^ 0. Taking the limit as t' —>■ 0, since the final term in Equation (13.111) must 
be non-negative, we conclude that 


N-l 


4 . \„ 1-1 \ ■ ■ ■ , v/p)^ ^ ^\ 2 P ^ ^ 

t=\CM\ >2^ --- = 2^ S^(tu,7VJ)(1,---,1) —— 


i=o 


i=o 


= C(c;z^;V,p), 


(3.12) 
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as claimed. 

(2) (1). Suppose (2) holds. Note that (1) follows if cj > 0 for all j, by the Schur 

product theorem, so we assume that cq, ... , cn-i > 0 and — C(c; ;N, p)~^ < cm < 0. 
We first show that /[—] preserves positivity on For 1 < m < let 

Cm--= V ,1)^^--= C (c^; 2:^"^+""; m, p), (3.13) 

j =0 C-N-m+j 

where := ■ ■ ■ ,cn-i)- Note in this case that Cn is precisely C{c-, ] N, p) 

as defined in Equation (|l.ll) (and N, M are fixed). It follows from Theorem 13.21 that 
Cl = /cn -1 and, for 1 < m < — 1, 


Cm+l Cn 


m—1 

— ^ ^ {•^ii{M—N+m+l,m+l,j+l)0-j ■ • ■ ) 1 ) 
1=0 
> 0 , 


2\P‘ 


^fj,{M—N+m+l,m,j){^^ • • • ) 1) ) 


m+M—N—j 


CN—m+j 


since 


7V+m+l,m+l,j+l) (1-5 • • • ; I) 
^fi{M ■ ■ ■ ) f) 


I'm + 1 + M — N\ Pm + M — N\ ^ 

V j + 1 A j ) 

m + 1 + M — N 


Thus 0<Ci<C'2<---<C'Ar. 

Next, we claim that for 1 < m < N" and all A = uu* G T’j^{D{0, p)), every principal 
m X m submatrix of the matrix 


L := Cm{cN-m'i-NxN + CN-m+lA H-+ ^^) - (3.14) 


is positive semidefinite; for m = N, this gives immediately the rank-one case of (1). 

The proof of the claim is by induction. The case m = 1 is immediate, since a general 
diagonal entry of L for m = 1 equals CiCat-i — > 0. 

Now suppose the result holds for m — 1 > 1. In the remainder of this proof, we adopt 
the following notation: given a non-empty set n C {1,..., A^} and an N x N matrix 
B, denote by the principal submatrix of B consisting of those rows and columns 
labelled by elements of J; we adopt a similar convention for the subvector Un of a 
vector u. If n C {1,..., A^} has cardinality m then, by Theorem 13.21 with v = u^, 

det L„ = C™->|A„(u„)| 2 n (C„ - V , 

J=l J=0 


Using the triangle inequality in C and the fact that the coefficients of any Schur poly¬ 
nomial are non-negative, it follows immediately that 


\s^{M — 7V+m,m, j)(Un)| ^ Sij,{M—N+m,m,j){'\/Pi ■ ■ ■ i y/p) 

_ „ /-I i\2 m+M—N—j 

^fj.{M—N+m,m,j)Ai ■■■OP i 


(3.15) 
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cHid so, by the choice of Crm the determiriciiit det Ln ^ 0 for ciiiy set n d iV} of 

cardinality m. Additionally, for each non-empty subset n of cardinality k < m, 

+ ... + c„_i - Af 

> o + ■ ■ ■ + 

since Cm > C'fc- It follows by the Schur product theorem and the induction hypothesis 
that det > 0 for any non-empty subset n of cardinality k < m. Hence all principal 
m X m submatrices of L are positive semidefinite, which concludes the proof of the 
claim by induction. In particular, /[—] preserves positivity on p)). 

We now prove that (1) holds for matrices of any rank; again we proceed by induction. 
For N = 1, the result holds because Vi{D{0, p)) = Vl{D{0, p)). Now suppose (1) holds 
for — 1 > 1, and let 

Pt[B; M, d] := t{dol + diB + ■ ■ ■ + dn-iB°^^-^^) - 

for any square matrix B of arbitrary order, where t is a real scalar and n is the length 
of the tuple d = {do,... ,dn-i)- It suffices to show that pt[A]M — A^,c] >0 for 
all t > C{c-, ; N, p) and all A = (aij) G Vn{D{^, p)). 

By a lemma of FitzGerald and Horn [T31 Lemma 2.1], if u G C'^ is defined to equal 
{diN/y/cLNN)^i when unn / 0, and is the zero vector otherwise, then A — uu* is 
positive semidefinite and its final column and row are both zero. Furthermore, the 
principal minor 


OiN (Inn 


= auaNN — loiTVp > 0, 


so the entries of uu* lie in D{0,p). Since 

f{z)-f{w)=[ {z-w)f{Xz + {1 - X)w)dX {z,wgC) 

Jo 

for any entire function /, it follows that 

Pt[A-,M—N,c] = pt[uu*; M—N, c]+ ( (A—uu*)oMp^/;^^[AA-|-(l—A)uu*; M—A^, c'j dA, 

Jo 

(3.16) 

where the {N — l)-tuple c' := (ci, 2c2,..., {N — l)cAr-i)- Now, since A — uu* has last 
row and column both zero, the integrand in (13.161) is positive semidefinite if the matrix 
Pi/mI^X: M — N, c'j is, where Ax G Vn-i{D{ 0, p)) is obtained by deleting the final row 
and column of AA -|- (1 — A)uu*. Thus, if we show the inequality 

C{c;z^;N,p) > M ■ C{c'-, z^-^; N - 1, p), (3.17) 

then both terms in Equation (|3.16l) are positive semidefinite for all t>C{c-z^-N,p), 
by the induction hypothesis, and this gives the result. 
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Finally, we prove (13.1711 : note that 


l,p) 



M-l-j- 

iV-l-j- 


2 pM-l-j 
{j + l)Cj+i 



M - (j + 1) - 1\ 


2 pM-(i+l) 

Cj+l 


3 + 1 
M 


^ vm - j - ly ^ _ i_ 

^ V 7 / W - j - ly Cj M 

<Cic-,z^-,N,p). 


Thus (2) implies (1), and so concludes the proof of the theorem. 


□ 


Remark 3.6. Theorem 11.11 shows further that, for any subset K <zC satisfying 

(0,p) C KcD{0,p), 

the polynomial f{z) := co+ci 2 ; + - • ■+cn-iz^~^+cmz^ preserves positivity on Vn{K) 
if and only if (cq, ..., CAr_i, cm) satisfies Theorem 11.11 2). More generally, given any 
class V of matrices such that 'P^((0,/?)) <ZV <Z Vn{D{Q, p)), Theorem 11.11 implies that 
/[—] preserves positivity on V if and only if (cq, ..., cat-i, cm) satisfies Theorem [LT]^ 2 ). 
Similarly, Theorem 11.11 shows the surprising result that to preserve positivity on all of 
T’n{D{ 0, p)) is equivalent to preserving positivity on the much smaller subset of real 
rank-one matrices p)). 


3.3. Positivity preservers: sufficient conditions. At this point, extensions of The¬ 
orem [LT] to more general classes of functions are within reach. We first introduce some 
notation. 


Definition 3.7. Given K G C, functions g, h : K ^ C, and a set of positive semi- 
definite matrices V C Uv=i^^('^)’ C{h;g-,V) be the smallest real number such 

that 

g[A]<C{h-,g-,V)-h[A], ^AgT. (3.18) 

In other words, the constant C{h',g-,'P) is the extreme critical value of the family of 
linear pencils {g[A\ — M/i[A] : A G V}. If hc{z) := cq -|- ciz is a 

polynomial with coefficients c := (cq, ... , c^-i), then we let 

C{c]g-,V) := C{hc;g;'P) 

to simplify the notation. Similarly, if P = T’]y{D{0, p)) for some p G (0, oo) and an 
integer > 1 , then we let 

C{h;g;N,p) := C{h-, g]VN{D{0, p))). 

Finally, let 

C{c;g-,N,p) := C{co H-h 5 ;PAr(-D(0, p))) 

for p G (0, 00) and integers n, N >1. 
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Note that the notation introduced in Definition 13.71 is consistent with the notation 
used in the previous sections of the paper. Also, note for future use that the constant 
C{h] g; V) is non-increasing in the last argument, as well as M+-subadditive in the second 
argument: 


C{h] Xigi + ■ ■ ■ + XnQn'j'P) gj]V), VAi,..., > 0. (3.19) 

j=i 

We now show how Theorem 1 1.1 1 can naturally be extended to arbitrary polynomials. 

Corollary 3.8. Fix a bounded subset K G C, and integers N >1 and M > 0. There 
exists a universal constant ^ depending only on N, M, and K, with the 

following property: for any integer R> 0 and any polynomial f{x) = x^Ylk=o^ 
with real coefficients such that 

(1) Co, ..., cn -1 > 0, and 

(2) minjcfc :0<A:<iV-l}> ' max{|Q| : q < 0}, 

we have /[—] : Vn{K) —)• V]\f{C). 

Proof. Without loss of generality, we may assume R = 0; the general case follows from 
the Schur product theorem. Now fix p > 0 such that K C D(0,p), and let 


M 

^nMk) ■= Y. ^((1’ • • • ’ 1); p') ^ (O’ (3-20) 

m=0 


where the 


M] 


first argument has N ones. Now given A E Vj^iK), one finds 

IV-l N+M N-1 N+M 

= CjA°^ + CjA°^ > min Ck y^ A°^ + m i n ci y^ A°T 
^ 0<fc<IV-l ^ l>N:ci<0 ^ 

j=0 j=N - - j=0 - j=N 


The result now follows immediately from Equation (I3.19|) . 


□ 


Next we show how our main result extends from polynomial to analytic functions. 

Proof of Theorem 11.31 Without loss of generality, we may assume that cm > 0 for all 
M > N, and, via a standard approximation argument, that the series in the statement 
is analytic in the disc D{0, p + e), where e > 0. 

The first part is immediate from Theorem 11.11 and Equation (j3.19p . To establish the 
bound (11.30 . note first that, for 0 < j — 1, we have 

fM -j -l\ _ Ml 

\N-j-l) ~ j\{M - N)\{N - j - 1)\{M - j) 

Ml 

- jl{M - N + l)l{N -j - 1)! 



M' 


IV- 1 


' M 
N-1 
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Using the above analysis and Tonelli’s theorem, we compute: 

OO 

^ cmC{c-, ] N, p) 


M=N 


N-1 jy_j OO 


f f ^ ~ j ~ M-N 


j=0 ^ M=N 

1 

-(iV-l)!2 A. 


V J 

N -lyp 


N-j-l 


p 


E 


N 


M-N 


< 


1 


E 


(iv - ir 


2-{n-i) 


^ y ''3 m=n 

n-iYp' 


N—l 2 ^N—j _oo 


E 


CMP. 

k=2 

(2(M -N) + k + 1)(2M -N + k) 


CMP 


M- 


y M=N 
N—l / ,r 2 V—c — l OO 


(iV-l) 


1)12 


i=o 


N -lY 


J 


k=2 
N 

n 

k=2 

27V-3 


Ci 


X] CM n 


2M-2V+2 


M=V k=0 


Notice that g 2 is analytic on D{0, since g is analytic on D{0, p + e). Therefore 

the inner sum is precisely the {2N — 2)th derivative of g 2 iz) = g+{z‘^), evaluated at 
z = Yp. This concludes the proof. □ 

Remark 3.9. The quantity g^ iy/p) can be written in terms of the derivatives of 
g+ at z = p; it may be shown by induction that 

Ln/2J 

[g+Y)) = ^ 


dn 


n\ 


{2x)^-‘^'^g^^ (n>0). 


A:=0 


(n — 2k)\k\ 


(3.21) 


This shows that one has explicit bounds for C{c;g‘,N,p) < C{c; g^; N, p) in terms of 
the derivatives of g+ at p. 

We conclude this part by showing how Theorem 11.11 yields an asymptotically sharp 
bound for the matrix cube problem for Hadamard powers. 

Proof of Corollary Note that Uig] C VnY) if and only if 
M+l 

^o~'n ~ colAfxAf + p4 + • • • +C]\i-iA°^^ — g{A°^ + • • • g Pjv(C). 

m=l 

Thus the first implication follows by Theorem 11.31 The other implication follows by 
setting ui = ■ ■ ■ = um = 0 in Equation (|1.5p and applying Theorem 11.11 

It remains to show the asymptotics in (11.81) . For this it suffices to show that 




(0 < m < M — 1). 


(3.22) 


N^oo C(c; z^+^;N, p) 

In turn, to show (|3.22p we first fix N and p, and write out each summand in the 
numerator and denominator of (I3.22p as follows: 

'N + mY fN + m- j - lY p^+^-^ 

N-j-l 


a(m,j) := 
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Next we bound the ratios of the summands for fixed N, m, M, and p, and for 0 < j < 
N-l: 

a{m,j) _ /(iV + m)!M!(A^ + M- 


a{M,j) \ ml{N + M)\{N + m — j) 


< 


< 


1 ( 

{N + M)2 

■■■{N + m + l)‘^\ 

1 / 

'Ml N + M-j\ 

jY2(M-m) \ 

.ml N -\- m — j ) 

1 ( 

'(M -1- !)l\^ ^m-M 

j^2(M-m) \ 

.{m + l)l) ^ 


P 


m—M 


where the final step follows from the observation that if M > m > 0 are hxed, and 
N — j = a > 1, then = 1 + has its global maximum at a = 1. Now let 




m—M 


(m + 1)! 

so that a{m,j) < a{M, j)b{m, M, for 0 < j < iV — 1. Hence 

0 < Cjc; Z^+^-,N,p) )j^-2(M-m) ^ 0 

- C(c;z^+^;A^,p) - ^ 
as N" ^ oo, as required. 


□ 


3.4. Case study: 2x2 matrices. Remark that Theorem 11.11 holds for any integer 
N > 1. When = 2, it is possible to prove a characterization result for polyno¬ 
mials preserving positivity on V 2 , for a more general family of polynomials than in 
Theorem 11.11 Along these lines, we conclude the present section with the following 
result. 

Theorem 3.10. Given non-negative integers m < n < p, let f{x) = CmX^ + Cnx"' + 
Cpx'P, with Cm and Cn both non-zero. The following are equivalent. 

(1) The entrywise function /[—] preserves positivity on P 2 ([ 0 , 1]). 

(2) The entrywise function /[—] preserves positivity on !])• 

(3) Cm, Cn> 0, and 

^ ^ -CmCnjn - nif 

^ ~ Cm{p - mY + Cn{p - n)2 ■ 

Note that if Cp < 0 then such an entrywise function does not preserve positivity on 

mo,!]). 

In the special case m = 0 and n = 1, note that the bound — -cmCn{n-m) — reduces 
to the constant —C(c;p; 2,1)“^. 

Proof. Clearly (1) implies (2). To see why (2) implies (3), note that Cm, c„ > 0 by 
Lemma lrH Now let ui, U 2 € [0,1] be distinct and let fmiui,U 2 ) := —uff)/{U 1 —U 2 ) 
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denote the divided difference of their mth powers. With u = {ui,U 2 )'^, 

0 < det /[uu'^] = f{ul)f{ul) - f{uiU 2 f 

= CmOn{uiU2f^{ur^ - + C^Cp{mU2f^ 

+ CnCp{uiU2f"‘{ul~"' - 


which implies that 

iui,U2)'^ _ 

“ Cmfp-m{ui,U2Y + Cn{uiU2Y‘^'^~^^ fp-n{ui.,U2Y 

Letting ui ^ 1 = U 2 yields (3). 

Finally, suppose (3) holds. Then (1) holds if and only if f{xyY < for all 

X, y & [0,1] and / is non-decreasing and non-negative on [0,1]; see, for example, [TSl 
Theorem 2.5]. By [38l Exercise 2.4.4], the first of these conditions is satisfied if and 
only if the function 

T/(x) := x(/"(x)/(x) - f'ixf) + f{x)f{x) > 0 
for all X G (0,1). A short calculation gives that 

T/(x) = x"*+"-“^(cmCn(n “ rn)'^ + CmCp{p - mfxP~'^ + CnCp{p - nfxP~"^), 


and so 


> 0 




-CmCn{n - mf 


Cm{p — m)‘^xP~'^ + Cn{p — n)‘^xP~'^' 


The final term has its infimum on (0,1) when x = 1, so we obtain the first condition. 
Next, note that / is non-decreasing on [0,1] if and only if 

/'(x) = mcmx^~^ + ncnx'^~^ +pcpx'^~^ >0 Vx G (0,1) 

—mCmX^~P — nCnX"'~P 




Cp ^ 


p 

TTlCm. 


Vx G (0,1) 


P 


Thus, it suffices to show that 

mcm + ncn 

p 


> 


CmCn{n - mf 


Cm{p - mf + Cnip - nf ’ 


but this holds because 


CmCn{m{p — nf + n{p — mf — p{n — mf) = CmCn{m + n){p — m){p — n) > 0. 

Thus (3) implies that / is non-decreasing on [0,1]. In turn, this implies that / is 
non-negative on [0,1], since /(x) > /(O) > 0 for x G [0,1]. Hence (3) implies (1). 

The final assertion is an immediate consequence of Lemma 12.41 □ 
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4. Rayleigh quotients 

Recall from Theorem 11.61 that Theorem ll.il can be reformulated as an extremal prob¬ 
lem, involving the boundedness of the generalized Rayleigh quotient for 
and A°^, taken over all matrices A E Vn{D{ 0, p)). We now consider an alternate 
approach to proving Theorem 11.11 by first considering the analogue of Theorem 11.61 for 
a single matrix. In this case a sharp bound may be obtained as follows. 


Proposition 4.1. Fix integers N > 1 and M > 0, positive scalars cq, ■ ■ ■, cn-i > 
0, and a non-zero matrix A E Vn{C). Then, with the notation as in Theorem, \l.(i\ 
and Definition [13 K,{A) C kerA°^, and the corresponding extreme critical value 
C{c] ] A) is finite: 


C{c-,z^-,A)-^ = 


mm 






v6S2Jv-inA:(A)-i 


* j^oM 


where ^ is the unit sphere in . 


In particular, 

.N-l . 

V* ( CjA°^ U>C{c-,z^-,A)-^- v*A°^v 
^j=o ^ 

for all V E C^. 

Note, moreover, that the minimum in Proposition 14.11 is attained for every non-zero 
matrix A E Vn{C) and integer M > 0, and over a compact set that is independent 
of M. 

The proof of Proposition 14.11 as well as a closed-form expression for the constant 
C{c; z^; A), is immediate from the following two results. 

Proposition 4.2. Given N > 1 and C, D ^ Pjv(C), the following are equivalent. 

(1) Ifv*C'v = 0 for some v E , then v*Zlv = 0. 

(2) ker C C ker D. 

(3) There exists a smallest positive constant \:}c,d such that v^Cv > i)c,D ■ v*Zlv 
for all V E C^. 

Moreover, if (l)-(3) hold and D 0, then the constant is computable as an extremal 
generalized Rayleigh quotient, as follows: 

flc,D= sup ^^ = g{C^/^DC^/^) = giX*C^X), (4.1) 

v^ker D ^ F'V 

where and E Vn{C) denote respectively the square root of the Moore-Penrose 
inverse and the Moore-Penrose inverse of C E Vn{C), q{—) denotes the spectral radius, 
and X is any matrix such that D = XX*. 

Proof. Clearly (3) implies (1). That (1) is equivalent to (2) is also immediate, given 
the following reasoning: 

v*Cv = 0 ^ = 0 

^ C'/V = 0 


Cv = 0 


v*Cv = 0 . 
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We now show that (1) implies (3). Given a matrix C, denote its kernel and the orthog¬ 
onal complement of its kernel by Kc and respectively, li^v ^ Kd then any choice 
of constant i}c,D would suffice in (3), so we may restrict ourselves to obtaining such a 
constant when v ^ Kd. Write v = vc -|- v^, with vc G Kq and G K^, and note 
that 7 ^ 0. Now compute: 

^ (vc)*Gvc ^ (vc/l|vcll)*C'(vc/l|vcll) 
v*Dv (v^)*Dv^ (vc/l|vcll)*^(yc/l|ycll)' 

It follows by setting w := vc/l|ycll that 


hc,D 


V Cv w Cw 

ini ——— = min -——, 


(4.2) 


where denotes the unit sphere in . The right-hand side is the minimizer 

of a continuous and positive function over a compact set, hence equals a positive real 
number. Thus i}c,D > 0, as desired. 

We now establish Equation (I4.1ji . by following the theory of the Kronecker normal 
form for a matrix pencil, as developed in m Chapter X, §6]. Because both sides of 
the last equality in Equation (j4.1h remain unchanged under a unitary change of basis, 
we may assume that C is diagonal, say C = diag(Ai,..., A,., 0,..., 0) with Ai > • • • > 
\r > 0. Pre-multiplying by = diag(A}^^,..., Ay^, 0,..., 0) preserves \ {0}, 
and therefore, setting w = 

_i v*Dv (v^)*D^r^ 

n= sup = sup , V, ^ V = s^P - 1 - 

v^kerDV^Cv v<fGA^\{0} (Vc)*<^yC wGA^\{0} ^ ^ 

= sup w* 

wg52iv-initi 

< sup = g{C^/^DC^/^). 

wg52y-i 


Moreover, any non-zero eigenvector of with non-zero eigenvalue has its last 

n — r coordinates all equal to zero, since = diag(A]^ , A^ 0,..., 0). It fol¬ 

lows that the unit-length eigenvectors corresponding to the eigenvalue q{C'^^^DC'^^^) lie 
in K^, thereby proving the result. The last equality in Equation (14.11) is an immediate 
consequence of the tracial property of the spectral radius. □ 

Lemma 4.3. Fix integers m > N > 1 and M >0, and matrices A, B G Vn{C). The 
matrices h{A, B) := A°^ and g{A, B) := BoA°^^ are such that ker h{A, B) C 

ker g{A, B). 

Proof. Suppose v G kevh{A,B), so that v*h{A, B)v = 0. Since B o A°^ G Vn{C), it 
follows that v*{B o A°^)v = 0 and {B o A°^)v = 0 for 0 < j < m — 1; in particular, 
the lemma is true if M < m. If, instead, M > N, then apply Lemma 13.51 together 
with the fact that D{C o B) = (DC) o B if B, C and D are square matrices, with D 
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diagonal, to see that 

h{A, B)v = 0 ^ {A°^ o B)v = 0 (0 < j < iV - 1) 

N-l 

J2DM,j{A){A°^ oB)v = 0 {A°^ o B)^r = 0. a 

j=o 

Remark 4.4. Note that Theorem 1 1. 61 does not hold in much greater generality, i.e., for 
more general functions g, h than the polynomials used in Lemma [4.31 For instance, fix 
(5 > 0 and consider continuous functions g, h : [0, <5) —)• [0, oo) such that g{0) = h{0) = 0 
and g{x), h{x) > 0 on (0,5). Given N >2 and f G R, let the N x N matrix A{t) := 
diag(t, 1,... , 1). There need not exist a universal constant 1) > 0 such that 

^r'^h[A{t)]v > f) • v'^g[A{t)]^r Vt G (0,5), v G R^. (4.3) 


To see this, note that A{t), h[A(t)] and g[A{t)] are positive definite for t G (0,5) and 
singular at f = 0. Thus, by standard facts about the generalized Rayleigh quotient, if 
(j4.3n holds, then 


f) < inf 


inf 


^h[A{t)]v 


te(o,s) veM^\{o} 'v'^g[A{t)]v 


ini X^Mm]-^^"h[A{t)]g[A{t)]-^/^) 
t€(0,S) 

inf mm{l,h(t)/g(t)}, 
te(o,<5) 


where Amin(C') denotes the smallest eigenvalue of the positive semidefinite matrix C. If 
h{t) = t^g{t) for some a > 0, then this infimum, and so 1), equals zero. Note, however, 
that such a case does not occur in Theorem II.61 


We now observe that Proposition 14. II and the explicit formula for the extreme critical 
value of the matrix pencil, as given in Equation (|4.ip , allow us to provide a closed-form 
expression for the constant C(c; z^; A) for rank-one matrices. 


Corollary 4.5. Given integers N >1 and M > 0, positive scalars cq, ..., cw-i > 0, 
and a matrix A = uu* G P^(C) \ {Oatxat}; identity 


.N-i . I 

C(c; z^-,A) = ^ j u°^ 

holds. In particular, given p > 0, 

.N-l V -1 

C{c-,z^]P1nxn) = j - 


'i=o 


with equality if and only if N = 1. 

Proof. Applying Proposition 14.11 and Equation (14.11) with C = hc[A] := Ylf=o 
and D = A°^ = u°^(u°^)*, we obtain 

C{c-z^-A) = ^(vv*), where v = h^[uu*]'^/^u°^. 
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Now, it is well known that the unique non-zero eigenvalue of vv* equals 

/JV-l \ t 


V V = 


{u°^yh^[A]^u°^ = ^ Cju°yu°y* ] u°^, 

Vj=o '' 


which shows the first assertion. For the next, set u = ^/p{l, ■ ■ ■ , 1)*, and use the fact 
that (auu*)! = a ^(u*u) ^(uu*) for non-zero a and u, to obtain that 


N-l V -1 


C{c; ] pInxn) = 


j=0 

It remains to show the last inequality. When = 1, 

^-1 pM 


C(c; z^-N, p) ^ CjfA = - co = p 

j=o 


M 


If instead > 1, then 


N-l 

C(c; z^-N, p) ^ Cjp> > ^ 
i=o i=o 




□ 


Remark 4.6. If ^ = uu*, with u having pairwise-distinct entries, then hc[A\ is the 
sum of N rank-one matrices with linearly independent column spaces, and hence is 
non-singular. In particular, it is possible to write C(c; z^^] A) for such A in an alternate 
fashion: if hc{z) = where cq, ..., ctv-i > 0 then 


C(c; z^; A) = (u°^^^)*/ic[uu*] = 1 — det /ic[uu*] ^ det 


,oM\*u 




(uoM)* 1 I • 


For the optimization-oriented reader, we now restate the main result as an extremal 
problem that follows immediately from Theorem 11.61 and Proposition 14.11 


inf 


mm 




A&pj^({o,p)) vGS2iv-in/c(A)J 


inf min 

A&Vn(D{0,p)) VGS2JV-In/C(A)- 


v*A°My 


N-l 2 /,.r i\2^M-j\-l 


E 

j=0 


My fM -j -ly y 

j) U-j-i. 


(4.4) 


or, equivalently. 


sup C{c;z^;A)= sup C{c-, z^; A) = C{c-, z^ ] N, p), (4.5) 

Mrjfao,p)) a&Vn(d{d,p)) 

where C{c;z^;A) = g{hc[Ay/‘^A°^hc[Ay/‘^), with hc{z) = CjzA 
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Remark 4.7. Note from the previous line that A i—)• C{c\ ] A) is continuous on 
the subset of the cone Vn{^) where det/ic[A] / 0. The obstacle to establishing (14.411 
and ()4.5I1 , and so proving Theorems 11.11 and 11.61 resides in the fact that this function is 
not continuous on the whole of Vn{^) for > 1. In particular, it is not continuous at 
the matrix A = pl^xN ^ p)), as shown in the calculations for Equation (j.S. 12 p 

and Corollarv l4. 51 above. However, these calculations also reveal that the sharp constant 
C{c-, ; N, p) is obtained by taking the supremum of the Rayleigh quotient over the 

one-parameter family of rank-one matrices 

{pu{t)u{t)'^ : u{t) := (1 - t,..., 1 - Nt)'^, t G (0,1/A^)}. 

5. The simultaneous kernels 

Prompted by the variational approach of the previous section, a description of the 
kernel }C{A) = ker(colArxiv + ciA -|- • • • 4- for any A G Vn{C) is in order. 

As we prove below, this kernel does not depend on the choice of scalars cj > 0, and 
coincides with the simultaneous kernel 

Pi kerA°’" 
n>0 

of the Hadamard powers of A. A refined structure of the matrix A, based on an analysis 
of the kernels of iterated Hadamard powers, is isolated in the first part of this section. 

5.1. Stratifications of the cone of positive semidefinite matrices. We introduce 
a novel family of stratifications of the cone PAr(C), that are induced by partitions of the 
set {1,... ,N}. Each stratification is subjacent to a block decomposition of a positive 
semidefinite N x N matrix, with diagonal blocks having rank one. In addition, the 
blocks exhibit a remarkable homogeneity with respect to subgroups of the group C^, 
the multiplicative group of non-zero complex numbers. 

Theorem 5.1. Fix a subgroup G C C^, an integer N > 1, and a non-zero matrix 
AgVn{C). 

(1) Suppose {Ii,..., Ik} is a partition o/{l,...,A'} satisfying the following two 
conditions. 

(a) Each diagonal block Aj. of A is a submatrix having rank at most one, and 
Aj. = UjU* for a unique n^ G with first entry Ujq G [0, oo). 

(b) The entries of each diagonal block Aj. lie in a single G-orbit. 

Then there exists a unique matrix G = such that Cij = 0 unless Uj 7 ^ 0 

and Uj 7 ^ 0, and A is a block matrix with 

Ai^^j. = djUiU* {l<i,j<k). 

Moreover, the entries of each off-diagonal block of A also lie in a single G-orbit. 
Furthermore, the matrix G G Pfc(Z?(0,1)), and the matrices A and G have equal 
rank. 

(2) If the following condition (c) is assumed as well as (a) and (b), then such a 
partition {Ii,... ,/fc} exists and is unique up to relabelling of the indices. 

(c) The diagonal blocks of A have maximal size, i.e., each diagonal block is 
not contained in a larger diagonal block that has rank one. 
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(3) Suppose (a)-(c) hold and G = C^. Then the off-diagonal entries of C lie in 
the open disc D{0, 1). 


In particular, given any non-zero matrix A E 'Pjv(C), there exists a unique partition 
{Ii,..., Ik} of { 1 ,... , A^} having minimal size k, unique vectors Uj G C^h\ with G 
[0,oo), and a unique matrix C G 1)), such that yl is a block matrix with 

= CjjUjut whenever 1 < < /c, and A has rank at most k. 


Proof. 

(1) Suppose I < i j < k, and l<l<l'<m<N, with I, f G f and m G Lj; the 
submatrix 


a ag 6 ' 

B := = I ag a\g\‘^ c 1 , 


dj 


where a, d > 0, g G G, and 6 , c G C. 

We claim that c G b ■ G, and that the minor 


a b 
ag c 


is singular. Now, 


0 < det B = —a(|cf -|- \bf\g\ — 2Re( 6 c 5 )) = —a|c — 


r,TTi2 


so either a = 0, in which case 6 = c = 0, by the positivity of B, or c = bg. This 
proves the claim. 

Applying this result repeatedly shows that every 2x2 minor of the block 
matrix with at least two entries in the same block, is singular, and 

the entries of any off-diagonal block lie in a single G-orbit. Thus there exists a 
unique Hermitian matrix C such that all assertions in the first part hold, except 
for possibly the claim that C G 'Pk{D{t), 1)). 

Fix any vector v G C^, and choose vectors Wj G such that u*Wj = vj if 
Uj 7 ^ 0, and arbitrarily otherwise. Define w := (wj,... ,w^)* G C^, and note 
that 

k k 

0 < w*Aw = ^ w*CijUiU*Wj = ^ vfCijVj = v*Cv, (5.1) 

i,j=l i,j=l 


so C G Rfc(C). Consequently, the entries of G are in L)(0,1), by a positivity 
argument, because the diagonal entries of G are all 0 or 1 . 

It remains to show that A and C have the same rank. Let J := {j G 
{ 1 ,..., /c} : Aj. / 0} be the set of indices of non-zero diagonal blocks of A, let 
I := Uj^jlj, and, for all j G J, let j' be the least element of Ij. Define a linear 
map TT : —)■ by letting TT{ej) := for all j G J, where £j and e'^ 

are the standard basis elements in and labelled by j and j', respectively. 
Then 

v*C'jv = 7r(v)*A77r(v), Vv G 

so TT restricts to a linear isomorphism between kerCj and ker A/n 7 r(C'^). Since 
the matrices A and Aj have equal rank, as have G and Cj, it follows that the 
ranks of A and G are equal; note that ker A/ n 7 r(C'^) is naturally isomorphic 
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to ker Cj, and {u' : j E J}-*- is a subset of kei Aj which has trivial intersection 
with 7r(C‘^), where u'- is the natural embedding of Uj in C^. 

(2) We first establish existence of a partition satisfying (a)-(c), by induction on N. 
The result is obvious for iV = 1, so assume the result holds for all integers 
up to and including some > 1, and let A E VN+iiC). Let {Ii,... ,1^} be 
the partition satisfying properties (a)-(c) for the N x N upper-left principal 
submatrix of A, and, for each j, fix a non-negative number, denoted by aj, 
which is a G-orbit representative for all entries in the diagonal block Aj.. 

Without loss of generality, we may assume aj / 0 for all j, since if amm = 0 
then amn = o,nm = 0 for all n, by positivity. We consider three different cases. 
Case 1: Aj.^j^^+i} 0 • G) for all j. 

In this case, the partition {Ii ,..., Ik,Ik+i '■= {-^ + 1}} clearly has the desired 
properties. 

Case 2: ^ 7 ^u{Y-i-i} ^ ' G) for a single value of j. 

In this case, augmenting Ij with -|- 1 yields the desired partition. 

Case 3: ^/,u{Af-ri} ^ ' G) and ^/.u{Ar-ri} ^ 'P\i.\+i{oij-G) for distinct 

i and j. 

We show that this case cannot occur. By the induction hypothesis, there 
exists m £ Ii and n E Ij such that the 2x2 minor ^ ' G)- Now 

set a 7 v-i-i,Ar+i = by assumption, there exist g, h G G such that am,N+i = cxg 
and ttn^N+i = cth. Thus 


I^{m,n,N+l} — I 0,mn 

\ ag 

so 

0 < det AT+l} = 


^mn 

ag\ 


a hp 

ah , 

(5.2) 

ah 

a) 


a\agh 

|2 

^mn \ 



and therefore ^ ' G), which is a contradiction. 

This completes the inductive step, and existence follows. 

To prove the uniqueness of the decomposition, suppose {Ii,...,!^} and 
{Ji,..., Jfc/} are two partitions associated to a matrix A E Vn{P-)-, and sat¬ 
isfying the desired properties. Without loss of generality, assume N E n Jj, 
and Ii 7 ^ Jj. Let a := a]\f,N and note that, since Ii and Jj are distinct and max¬ 
imal, there exist m £ Ii and n E Jj such that m ^ n and A^^ ^} ^ P 2 ' G) 
as above. But then the principal minor is of the form (I5.2p . which 

is impossible, as seen previously. It follows that h = Jj and the partition is 
unique, again by induction on N. 

(3) If \cij\ = I for some i ^ j, then A/. U7, = VV* E (a • G) for some a, 

where v := (u*,CjjUp*. This contradicts the maximality of Ii and Ij, so any 
off-diagonal term Cij £ D{0, 1). □ 


Remark 5.2. It is natural to ask if a similar result to Theorem 15.II holds if we assume 
the blocks to have rank bounded above, but not necessarily by 1. This is, however, 
false, as verified by the example A = Ida: the partitions {{1,2}, {3}}, {{2,3},{I}}, 
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and {{1,3}, {2}} are all such that has rank at most 2. However, A has rank 3, 

so there is no unique maximal partition when we allow blocks to have rank 2 or higher. 

Using the maximal partition corresponding to each subgroup G C C^, we define a 
stratification of the cone Vn{^)- The following notation will be useful. 

Definition 5.3. Fix a subgroup G C and an integer > 1. 

(1) Define (n^r, -<) to be the partially ordered set of partitions of {1,... , A^j, with 
vr' ^ vr if TT is a refinement of vrh Given a partition tt = {/i,..., Ik} G Hjv, let 
|7r| := k denote the size of vr. 

(2) Given a non-zero matrix A G Vn{C), define tt^{A) G Hjv to be the unique 
maximal partition described in Theorem 15.II Also define T^^iO^xN) to be the 
indiscrete partition {{1,...,A'}}. 

(3) Given a partition vr = {/i,..., 1^} G H^r, let 

:= {A G PAr(C) : 7r‘^(A) = tt}. (5.3) 

Given a subgroup G of C^, there is a natural stratification of the cone according to 
the structure studied in Theorem ED 

VNiC)= □ 5^, 

ttGIIjv 

where the set of strata is in bijection with Hat. The following result discusses basic 
properties of these Schubert cell-type strata. 

Proposition 5.4. Fix a subgroup G C and an integer N > 1. 

(1) For any partitiomr £ Hn, the set has real dimension + {N—\'k\) dim^G, 

and closure 

U 

7r'-<7r 

(2) For any A G Vn{C), the rank of A is at most |7r^^(A)|. 

Note that G C can have real dimension 0,1, or 2. 

Proof. 

(1) Suppose vr^(A) = {Ii ,..., Ik}. Theorem l5.1l implies that a generic matrix in 

is created from a unique matrix G G with only ones on the diagonal, 

and unique non-zero vectors Uj G with Ujp G [0, oo) and G Ujq • G 
for all / > 1. Thus the degrees of freedom for A equal those for the strictly 
upper-triangular entries of G and for the Uj, i.e., 

dimK^^ = dimigC -k ^^(1 -k {\Ij \ - l)dimKG) 

= -k (A^ — A;) dimR G. 

The second observation is straightforward. 

(2) This is an immediate consequence of Theorem 15.Il f3l. □ 

The following corollary provides a decomposition of a matrix A G PAr(C) that will 
be very useful for studying the kernel of colAfxW + ciA -k • • • + 
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Corollary 5.5. Let A G Vn{C) and let G be a multiplicative subgroup of S^. There 
is a unique partition of such that the corresponding diagonal 

blocks A[. of A satisfy the following properties. 

(1) The entries in each diagonal block Aj. belong to Oj ■ G for some aj >0. 

(2) The diagonal blocks have maximal size, i.e., each diagonal block is not contained 
in a larger diagonal block with entries in aj ■ G for some aj. 

If, moreover, G = {1}, then A has rank at most k. 

Finally, if {Ii,..., Ik} is any partition satisfying (1) but not necessarily (2), then 
the entries in every off-diagonal block AI^xIj also share the property that they lie in a 
single G-orbit in C. 

Notice that the result follows from Theorem 15.II because if G C S^, then every block 
with entries in a single G-orbit automatically has rank at most one, by Theorem 15.81 
below. 

In what follows, we use Corollary 15.51 with the following two choices of the sub¬ 
group G: 

( 1 ) G = {!}, in which case all entries in each block of A are equal; 

( 2 ) G = S\ so all entries in each block of A have equal modulus. 

Remark 5.6. Remark that the diagonal blocks of A as in Corollary 15.51 mav be 1 x 1 
(for example, in the case where the diagonal entries of A are all distinct). Moreover, 
the partition of the indices as in Corollary 15.51 does not determine the rank of A. For 
instance, let cji, ..., ujn £ be pairwise distinct, and let u := (wi,... ,ujn)*- Then 
the identity matrix Htv and uu* have different ranks for N > 2, but both matrices 
correspond to the partition of { 1 ,... , A^} into singleton subsets. 

5.2. Simultaneous kernels of Hadamard powers. We now state and prove the 
main result of this section, which in particular classihes the simultaneous kernels of 
Hadamard powers of a positive semidefinite matrix. 

Theorem 5.7. Let A G Vn{C) and let {/i ,... ,Ik} be the unique partition o/{l,..., iV} 
satisfying the two conditions of Corollaru 15.51 with G = {1}. Fix B G PAr(C) with no 
zero diagonal entries, and let cq, ..., cn-i > 0. Then 

ker(H o (coIatxAt H-h = Q ker(H o A°"‘) = ker © • • • 0 ker H 4 , 

n>0 

where Bj. are the diagonal blocks of B corresponding to the partition {/i,... ,/fc}. 

Note that when the Hadamard product is replaced by the standard matrix product, 
the simultaneous kernel nn>ikerH”' equals kerH. In contrast, characterizing the si¬ 
multaneous kernels of Hadamard powers is a challenging problem. Also observe in 
Theorem 15.71 that the simultaneous kernel does not depend on cq, ..., CAr_i. 

The proof of Theorem 15.71 repeatedly nses a technical result, which we quote here 
for convenience. 

Theorem 5.8 (Hershkowitz-Neumann-Schneider, [271 Theorem 2.2]). Given an NxN 
complex matrix A, where N >1, the following are equivalent. 

(1) A is positive semidefinite with entries of modulus 0 or 1, i.e., A G PAr(S'^U{0}). 
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(2) There exist a diagonal matrix D, all of whose diagonal entries lie in , as well 
as a permutation matrix Q, such that {QD)~^A(QD) is a block diagonal matrix 
with each diagonal block a square matrix of either all ones or all zeros. 

Equipped with this result, we now prove the above theorem. 


Proof of Theorem 5.7. We begin by showing the first equality. One inclusion is imme¬ 
diate; for the reverse inclusion, let u G ker(i? o (cqItvxTV + ciA 

Then u G ker(i? o ^4°”) for 0 < n < iV — 1, since A and B are positive semidefinite. 
Applying Lemma 031 we conclude that u G ° A°”). 

We now prove the second equality. Let {Ji,..., J;} be a partition of {1,..., A"} as 
in Corollary 15.51 with G = S^, i.e., with the entries in the diagonal blocks having the 
same absolute value, instead of necessarily being constant. Clearly, {/i,...,/^} is a 
refinement of the partition {Ji,..., J/}. We proceed in three steps. We first show that 


Pi ker(5 o C p ker(Sj, o A}™) © • • • © p ker{Bj^ o A°j^). (5.4) 

n>0 m>l m>l 


We then prove that each kernel of the form ker(i?j. o further decomposes 

into keri?/,^ © • • • © keri?/.^, where J* = Iq U • • • U 1*^. Finally, we prove the reverse 
inclusion, i.e., 

k 

0 keri?,^ C P ker(i? o A°"'). 
m=l n>0 


Equation ()5.4p is obvious if / = 1, so assume I > 2. Let i satisfy an > max^^ ajj > 0 
and suppose, without loss of generality, that z G Ji. Now write the matrices A and B 
in block form: 


A = 



^22 / 


and B 


(^11 ^ 12 \ 

\Bl^ B22) 


where the (1, 1 ) blocks correspond to the Ji x Ji entries, and the ( 1 , 2 ) blocks correspond 
to the Ji X entries of the matrices, where Jf := J 2 U- • -U J/. Then, by Theorem 15.81 we 
conclude that afj^An = vv* for some v G Moreover, we claim that the entries 

of a~^Ai 2 have modulus less than one. Indeed, since the entries of the matrix a~^A 
lie in the closed unit disc L>(0,1), by a positivity argument, we can choose a sequence 
of integer powers n), —>■ 00 such that A^o '.= limfc_>.oo(a^^A)°"’'= exists entrywise. Note 
that Aoo G PAr(C), by the Schur product theorem. Now let m ^ Jf and consider 
the submatrix A' := (Aqo) which is positive semidefinite and has entries with 
modulus 0 or 1. Thus, by Theorem 15.81 A' = ww* for some w G with = 0 

or 1. By the maximality of Ji, it follows that < 1 for all j G Ji and so all the 

entries of a~^Ai 2 have modulus less than 1 . 

Now let u = (u);,U 2 )* G o A°"), where ui G U 2 G and 

||u|| = 1. We will prove that {Bn o = 0 and {B 22 ° A 22 )u 2 = 0 for all n > 1. 

To do so, fix e G (0,1) and let aj := aig{vj)/ {2tt) for j = 1, ..., | Ji|. Suppose {^o := 
1 , 01 , ... , 9p} is a Q-linearly independent basis of the Q-linear span of {1, ai,..., a| jp}- 
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Then there exist integers rrijk, and an integer M > 1, such that 


an = 




fc =0 


By Kronecker’s theorem |22L Chapter 23], since O^/M are Q-linearly independent, there 
exist sequences of integers and (pr)^i such that Ur ^ oo and \nr{6k/M) — 

{9k/M) — Pr\ < e for 0 < A: < p. It follows that, for any j, j' G Ji, 

= I exp(2Trinr{aj - af)) - exp(2TTi{aj - aj'))\ 


< 


27r^(mjfc - rrifk] 


k=0 


TlrOk 6k 

M 


Pr 


< 2-71 max \nr{dk/M) — {9k/M) — Pr\ / \'mjk — 

fc= 0 ,...,p ^ 

— 

for some constant Cjj' > 0 independent of e. Let C := maxll,^^^' : j, / G Ji}. 

Replacing by a subsequence if necessary, we may assume without loss of 

generality that converges entrywise to a limit 


Anso ■ - 


A 


CXD.II 


0 


0 , 


^ 00,22 


VflxIJil 

Passing to a further subsequence, we may also assume the entries of —^ 00,11 

are at most Ce in modulus. Moreover, 

2 

ul(Bkk o Aoo,kk}uk = ^li^u*(Ro = 0 , 

k=l 

whence (Bkk ° ^oo,A:fc)ufc = 0 for k = 1, 2. Therefore, since ||ui|| < 1, 
o ^ii)ui|| 

< aii\\{Bii o {a^^An - (aG^Aii)°^"))ui|| + ||(Rii o [{aT^AuY^- - Gloo,ii))ui|| 

< aaWBii o All - {<hi^+ ll-Bii o - ^00,11)|| 

< max \Bjk\ {an + 1 ) | Ji| Ce; 

j,keJi 

as e is arbitrary, we must have {Bu o ^ii)ui = 0. Furthermore, since u G ker B o A, so 

{Bn o ^ii)ui + {Bi 2 o Ai 2 )\i 2 = 0 , 
whence {B 12 o Ai 2 )vl 2 = 0, and 

{B12 o 74 i 2 )*ui + {B22 o ^22)112 = (-R21 o ^2i)ui + {B22 o ^22)112 = 0. 


This implies that 

0 = ((Bi2 O j4i2)u2)*Ui + U2(i?22 ° ^22)u2 = U2('®22 ° ^22)u2) 
and therefore {B22 o A22)u2 = 0 , since B22 o 2I22 is positive semidefinite. 
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Repeating the same argument, with A replaced by for some fixed m > 1, we 
conclude that 

i^kk o AkT)^k = 0 (k = 1,2, m> 1). 

Hence 

f| ker(R o H°”^) c f| ker(Rii o A^^) © f| ker(H 22 o 

m>l m>l m>l 

and we conclude by induction that 

Pi ker(R o A°^) C p ker(Rj, o AJ^) © • • • © p ker(Rj, o A}^). 

n>0 m>l m>l 

We now examine the simultaneous kernel of Hadamard powers of a non-zero diagonal 
block BJ^ o A°j^. Assume without loss of generality that |ajA;| = 1 for all j, A: € Ji, and 
that Ji = Ii U • • • U A for some integer t > 1. By Theorem 15.81 we obtain Aj. = vv* for 
some V G It follows that Aj. is itself a block matrix with 1 throughout each 

diagonal sub-block. Thus we can write 

Y — (Ailixni)''')Allxnt) and A — vv — (Aj In; xn^ )i J= 1 1 

with Ai, ..., At G 5^ pairwise distinct, and | Ji| = ni + • • • + rit. 

Now let u G nm>i ker(i?j. o Aj™) and let u = (uj,..., u*)* G C"'^ © • • • © C”* be the 
decomposition of u corresponding to the partition {/i,..., It} of Jj. Let Bjk := Bj.j^; 
we claim that Uj G kerBjj for all j. Note that 

t 

^ Afc)'”ufc = 0 Vm > 1, 

k=l 

from which it follows that 

t 

= 0 Vm > 1. 

k=l 

Thus, for fixed j, the vector {vL*-BjkWk)\.^^ belongs to the kernel of the transpose of the 
Vandermonde matrix 

V= 

Since the \j are distinct and non-zero, the matrix V is non-singular. Consequently, 
UjBj/^u/g = 0 for all j, k] in particular, u*BjjUj = 0, and therefore Uj G kerBjj. This 
completes this step and shows that 

P ker(i? o A°"') C ker Bj^ © • • • © ker Bj^. 
n>0 

We now prove the reverse inclusion. First, we claim that if C = {Cij)lj^i is a block 
matrix in Vn{C), and u = (uJ,..., u^)* G is the corresponding block decomposition 
of u, then 


u G ker Cn © • • • © ker Cu 


u G ker C. 


(5.5) 
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We prove the claim by induction on t. The base case of t = 1 is obvious. Now suppose 
(jS.Sp holds for t — 1 blocks. Let u = (uj,..., u*)* = (u'*, u*)*, with Uj E kerCjj. Then 
u*CjjUj = 0 for all j. Partition C with respect to the same decomposition: 


C 


c 

(CD* 


C't 

Ctt 


By the induction hypothesis, u'*C''u' = 0. On the other hand, for any A E M, 

0 < (u'*,Au*)C(u'*,Au*)* = 2ARe(u'*C';ut). 

This implies that Re(u'*C'^Ut) = 0, from which it follows immediately that u^Cu = 0, 
and so u E kerC. This proves the claim. 

Now, to conclude the proof, let u E kerR/^ © • • • 0 ker R/^,. Then, for all n > 0, 
u E ker(R/^ o © • • • © ker(R4 o A^^), since the entries of A are constant in each 
diagonal block Ai^. R follows by (15.51) that u E ker(i? o A°'^) for all n > 0. 

This concludes the proof of the theorem. □ 


Remark 5.9. We note the following consequence of Theorem 15.71 Given a matrix 
B E P7v(C) with no zero diagonal entries, as A runs over the uncountable set V]y{C), 
the set of simultaneous kernels 


{nn>okev{BoA°'^):AGVN(C)} 

is, nevertheless, a finite set of subspaces of . Moreover, this hnite set is indexed by 
partitions of the set {1,... ,N}. The case when B = ItvxTV; or, more generally, when 
B has no zero diagonal entries, is once again in contrast with the behaviour for the 
usual matrix powers, and provides a stratification of the cone Vn{C). 

We conclude this section by strengthening Theorem 15.71 Given an integer iV > 1, 
matrices A, B £ V^IC), and a partition tt = {Ii, ..., Ik} E IlAr, let 

k 

}C^{A, B) := 0 f| ker(R,^. o AJ^). (5.6) 

j=l n>0 

We have from the proof of Theorem 15.71 that 

^{{i,...,N}}iAB) =K,^sif^^~^{A,B) =( 5 . 7 ) 

Our final result analyzes the set of partitions vr for which Equation (15.7p holds. 

Theorem 5.10. Fix an integer N > 1 and matrices A, B £ Vn{C), with B having 
non-zero diagonal entries. Then, 

{vr E Bat : }C^{i,...,n}}(A B) = 1C^{A, R)} D {tt : vr © 7 r{i>(Al)}. ( 5 . 8 ) 

The reverse inclusion holds if B £ R^(C). 

In particular, for any subgroup G C C^, Equation (|5.7p holds with 7r'^^(j4) replaced 
by 7r^(7l). 
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Proof. Suppose (^4) = {Ii,..., } is a refinement of vr = {Ji,..., Ji}. Then, 

restricting to each Jp x Jp diagonal block, 

AX) 

n>0 

equals ){^Jp^ Bj^), by Theorem 15.71 We are now done by taking the direct 

sum of the previous equation over all p, since 

Conversely, suppose that B G P^(C) and that is not a refinement of vr. 

Then, without loss of generality, there exist indices ii and i 2 which lie in distinct parts 
of TT but the same part of The vector with zith entry equal to 

entry equal to —and all other entries eqnal to 0, lies in B), but it does 

not lie in K,Tt{A, B). □ 

6. Conclusion and survey of known results 

It is the aim of the present section to discuss, from a unifying point of view, a 
collection of old and new computations of sharp bounds for extreme critical values of 
certain matrix pencils. As mentioned in the introduction, this was a recurrent theme, 
motivated by theoretical and very applied problems, spanning more than half a century. 

In all the examples which follow, we identify a numerical evaluation of the extreme 
critical value of a concrete matrix pencil. An authoritative source for the spectral 
theory of polynomial pencils of matrices is [35]. 

Specifically, Equation (j6.ll) below provides an accessible, often computationally effec¬ 
tive, way of expressing the rather elusive C{h; g; V), which is, by definition, the smallest 
real constant C satisfying 

g[A\ < C/ijA], for all matrices A gV. 

We do not exclude above the case C = oo, which means that no uniform bound between 
g[A\ and h[A\ exists. A second general observation is the stability of the bound as 
a function of the matrix set: more precisely, quite a few examples below share the 
property 

C{h;g;V)=C{h-g-,V'), 

where V' CV is & much smaller class of matrices. 

(1) The first of Schoenberg’s celebrated theorems proved in |l2j involves convergent 
series in Gegenbauer polynomials. The result can be formulated as a matrix 
pencil critical-value problem, as follows: fix an integer d > 2, set K := [—1,1], 
and define 

Hz) = '^hnCi^Hz) and g{z) = '^gnC^^Hz), 

n>0 n>0 

where € [0, oo), G M, A = (d — 2)/2, and is the corresponding Gegen¬ 
bauer or Ghebyshev polynomial. Also let V denote the set of all correlation 
matrices with rank at most d but of arbitrary dimension. Then Schoenberg’s 
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Theorem 12.ll fl) asserts that 

C{h]g;V) = sup (6.1) 

n:gn>0 

When hn = 0 and Qn > 0 for some index n, the constant C{h]g;V) is equal 
to infinity, whence there is no uniform bound for g[A\ in terms of h[A\, when 
taken over all matrices A gV. 

(2) Schoenberg’s second landmark result from [42j, as well as its subsequent ex¬ 
tensions by Christensen and Ressel, Hiai, and others (see the references after 
Theorem I2.1ji . can also be rephrased as an extreme critical-value problem, as 
follows. Set K := {—p,p), where 0 < p < oo, and consider the convergent 
power series 

Hz) = g{z) = ^ gnz^ ; (-p, p) ^ M, (6.2) 

n>0 ri>0 

where G [0, oo) and G M for all n, and V = UAr>i Then 

C{h;g;'P) can be computed as in Equation (j6.1ji . 

(3) In the papers [3T|, BT] . Rudin, working with Kahane, proved that preserving 
positivity on low-rank Toeplitz matrices already implies absolute monotonicity. 
In this case, set K := (—1,1), and let h{z) and g{z) to be as in Equation ()6.2I) 
with p = 1. Also let V := Uam T’y((~/ 0, p)) and let V denote the set of 
Toeplitz matrices of all dimensions and of rank at most 3. Rudin showed in 
mi Theorem IV] that it suffices to test the pencil bound on the set Va ■= 
{M(a,b,n,a) : 0 < a, b, a -|- 6 < 1, re > 1} for any irrational multiple a of tt, 
where M{a,b,n,a) G 7^))([—1,1]) is the Toeplitz matrix with (j, fc)th entry 
a + bcos{{j — k)a). 

Then C(/i; p; R) = C{h; g;V') =C{h]g]Va) and Equation (j6.ip holds. 

(4) A necessary condition for preserving positivity in fixed dimension was provided 
by Horn in [30]. The condition was subsequently extended by Guillot-Khare- 
Rajaratnam [T8| and is stated in Theorem 12.21 above. This yields a special case 
of the extreme critical-value problem, with 

N-l N -1 

Hz) = ^ hnz"- and g{z) = ^ (6.3) 

n=0 n=0 

where hn, p„ G [0,oo) for all re, and V = P^((0,p)). Then C{h]g]V) = 
C(/i;p;PAr((0, p))) and Equation ()6.ip holds. 

(5) The problem of preserving positivity has recently attracted renewed attention, 
due to its application in the regularization of ultra high-dimensional covariance 
matrices. The next few observations are along those lines. First, in [TSl Propo¬ 
sition 3.17(3)], the authors consider a more general situation than the previous 
instance, where one replaces the polynomials h, g of degree at most — 1 by 
one of the following. 

• A linear combination of N fractional powers 2“, with K = (0,p) and 
V = V]^{K)-, here a can be negative. 

• A linear combination of N fractional powers z^ and the constant function 
1, with K = [0,p) and V = V}g{K)-, here 0“ := 0 for a G M. 
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• A linear combination of N fractional powers of the form 4>a{^) •= or 

:= sgn(2;)|2;|", with K = {—p,p) and V = 'P]^{K)-, here (paiO) = 

MO) := 0 . 

In each case, if Pn £ [0,oo) for all n, with both h and g involving the same 
set of fractional powers, then C{h]g;V) = C{h;g\VN{K)) and Equation (16.111 
holds. 

(6) Note that Lemma [2.4l also yields an extreme critical value that can be deduced 
from either of the two previous cases. In particular, if h and g are power 
series, as in Equation ()6.2jl . and we define V = UAr>i^Af((0,/5)) and V = 
Ujv>iP))) then C{h\g]V) = C{h]g]V') and Equation (|6.ip holds. This 
result was also shown, using alternate approaches, in m- 

(7) Another application involves entrywise functions preserving positivity on ma¬ 
trices with zero structure according to a tree graph T. Recall that for a graph 
G with vertex set {1,..., A"}, and a subset K C C, the cone Vg{K) is defined 
to be the set of matrices A = {aij) E Vn[K) such that A i ^ j and (z, j) is not 
an edge in G, then Uij = 0. An extreme critical-value phenomenon was shown 
in m-- fix powers 

0</<r<s<s'< oo, r > 1, 


as well as a measurable set B C (r, s). Also fix scalars o,.', a^, a*, a^/ > 0 and 
a measurable function I : B ^ M.. Now define 



and V = Utg 5 where 5 is a non-empty set of connected trees on 

at least 3 vertices. Then m Theorem 4.6] shows the existence of a finite 
threshold: C{h;g;'P) E (0,oo). Note that this is an existence result, in contrast 
to the sharp bounds obtained in all previous examples. 

(8) We turn now to the present paper, in which extreme critical values were ob¬ 
tained for various families of linear pencils. Eor all of the remaining examples, 
fix the following notation: A > 1 and M > 0 are integers, p E (0, oo), and 


Co, Cl,..., CN-i E (0, oo), 

hc{z) = Co CN-iZ^ 

and Vjf{{0,p)) CV C Vn{D{0,p)). 

Then the main result of the present paper, Theorem ll.il says that 


C[he , z 


M 


N-1 

z ’) = E 

j=0 


M 


^ - j - ly p^-^ 

N-j-1 


(6.4) 


Note that if M = A, R = Vn{[—p,p]), and A tends to infinity, then the 
extreme critical value grows without bound, thereby recovering Schoenberg’s 
result discussed in the second example above. 
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(9) TheM+-subadditivity ofC(/i; 5 ;P) in its second argument, as in Equation (13.191) . 
immediately yields, for a polynomial g{z) = Yln=o ^ 

C{hc\g]V)< ^ gnC{hc]z^\V), 

n-.gn>0 

where the notation is as in (|6.4p . 

(10) When g(z) = Y1 ’m=n analytic on D{0,p) and continuous on D{0,p), 

with real coefficients. Theorem [T3] yields a bound on the corresponding extreme 
critical value: 

.. 9r~^\vp) 

C{c, g, N, p) < ^/v-i/AT 102 / ^ 


2^-i(A^- 1)! 


j=0 


J 


where g+{z) = 92 {z) = g+iz"^)- 

(11) In the case of 2 x 2 matrices, fix non-negative integers m < n < p, set ^ 2 ( 2 ) = 
CmZ^ + Cnz'^, and suppose Vl{[Q,l]) C V C P2([0,1]). Then Theorem 13.101 
shows that 

r(h P V'\ = ^rnCnjn - mf 
^ ^ Cmip-mY + Cnip-ny 

(12) The next instance involves Rayleigh quotients for Hadamard powers. In this 
case, we let V be the set containing a single non-zero matrix A G Pw(C). Define 

}C{A) =ker/ic[^] = ker(colArxAf + ci^ H-h 

Then Propositions 14.ll and 14.21 show that, with notation as in (16.4p . 

v*A°^v 


C{h,;z^-A) = 


max 


= g{hc[A]^/^A°^hc[A]^/^) < 00 . 


ve52iv-inK;{A)i v*hc[A]\- 

In particular, C{hc; z^ ;uu*) = (u°'^)*/ic[uu*]fu°^ for all non-zero u € C^, 
by Corollary 14.51 

(13) Our final example involves an application of Theorem [521 which we obtained 
a stratification of the cone Vn{C) by the set IItv of partitions of {1,... , N}. 
Given a partition vr G IItv, define the stratum as in Equation (|5.3I) . The 
key observation is that the simultaneous kernel map A 1 —)• K,{A) is constant on 
each stratum. In other words, the map 

N-l 

K : Vn{C) ^ Rw ^ □ Gi{r,C^) 

r=0 

sends every matrix A G to a fixed subspace 

kl 

JC^ :=ker^l,,x/, G Gr(A^ - |7r|, O, 
i=i 


where the N x N matrix 1 _exE has (f, j)th entry equal to 1 if (f, j) & E x F and 
equal to 0 otherwise, |7r| = k denotes the number of parts in the partition vr = 
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and Gr(r,C'^) denotes the complex Grassmann manifold of r- 
dimensional subspaces of C'^. 

Now suppose h{z) is as in (I6.4|) . 

M 

9{z) = '^gnZ^ (ffn G [0,oo)), and V= |J 5°, 

Ti —0 Tr^XIjY 

where C is a compact subset of the corresponding stratum. Then 
C{hc]g-,V)< ^ gnC{hc-,z'^]V) 

n:gn>0 

= E ® ■5°}' 

and this is a finite number because the inner maximum is taken over a product of 
compact sets, and the function being optimized is continuous in both variables. 

It would be interesting to investigate the jumping locus of the map A i-A 
C{hc', z^] A). In particular, is this map continuous on the stratum 5^^^, for 
each partition tt G IIjv? 

More generally, given any matrix B G Pjv(C) with no zero diagonal entries. 
Theorem 15.71 provides a stratification Vn{^) = UTrenjv™ ^ similar vein 
to the above; thus ^ for all vr G IItv- Once again, the map K, is 

constant on each stratum, sending to ICb,^ say. Then, by Proposition 14.21 
and Lemma [4.31 the inequality in (16.51) generalizes, for each fixed B, with hc[A\ 
and replaced by S o hc[A\ and B o A°"‘ for all n, respectively. See the 
preceding section for full details. 

List of symbols. A few ad hoc notations were introduced in the text. We list them 
below for the convenience of the reader. 

• V^{K) is the set of positive semidefinite N x N matrices with entries in a 
subset K G C and of rank at most k. 

. VNiK) ■.= V^{K). 

• A°^ is the matrix obtained from A by taking the kth power of each entry. 

• IaxA is the N x N matrix with each entry equal to 1. 

• f[A] is the result of applying / to each entry of the matrix A. 

• C{h]g]'P) is the smallest non-negative constant satisfying g[A] < C{h; g;V)h[A] 
for all A G P. 

• q{A) is the spectral radius of the matrix A. 
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